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Abstract 

Consider decision problems derived from optimization problems. For problems in this 
category, we show that they cannot be expressed in existential second order (ESO) logic 
with a first order part that is universal Horn (or simply, ESO ITi Horn) at the structure 
level, as opposed to machine level. This is true even when we consider ordered structures. 
We show that an NP-complete problem (Maximum Independent Set) and a polynomially 
solvable problem (Maximum Matching) have identical ESO Hi (non-Horn) expressions 
when the only unknowns are the second order predicates, hence this logic is too weak to 
distinguish between the two classes. At the structure level, Fagin's theorem is false in 
one sense, in that ESO universal sentences under ordered structures cannot capture the 
class NP. Also at the structure level, under ordered structures, least fixed point logic 
with counting (LFP+C) is insufficient to capture P, the class of problems decidable in 
polynomial time. 

Keywords. Descriptive complexity, Optimization, Existential second order logic, Least fixed 
point logic, Machine level and structure level. 

AMS classification. 90C99, 68Q19, 68Q15, 68Q17, 03C13. 



1 Introduction 

We first introduce some notation and definitions. For the contributions of this article, see 
Sec. O 

We work with decision versions of optimization problems. The reader is assumed to have 
some background in Finite Model Theory. If not, the book by Ebbinghaus and Flum [3] 
serves as a good introduction. Publications [8] and [13] are also relevant material for this line 
of research. 

Definition 1. Let P (NP) be the class of problems decidable in time polynomial in the size 
of the input deterministically (non-deterministically), respectively. 



Definition 2. A P- optimization problem Q is a tuple Q = {Iq, Fq, /q, opto), where 

(i) Iq is a set of instances to Q, 

(ii) Fq(I) is the set of feasible solutions to instance I, 

(Hi) /q(I, 5*) is the objective function value to a solution S G Fq{I) of an instance I G Iq. 
It is a function f : U/gJq[{-0 x ^qCO] — ^ (non-negative reals Jj, computable in time 
polynomial in the size \A\ of the domain A o/Jp, 

^zuj For an instance I & Iq, optQ(I) is either the minimum or maximum possible value that 
can be obtained for the objective function, taken over all feasible solutions in Fq(I). 

optQ(I) = max /q(I,S) (for P -maximization problems), 
optQ(I) = min /q(I, S) (for P- minimization problems), 

(v) The following decision problem is in the class P: Given an instance I and a non-negative 
constant K, is there a feasible solution S G Fq(I), such that /q(I, S) > K (for a P- 
maximization problem), or /q(7, S) < K (in the case of a P -minimization problem)? 

And finally, 

(vi) An optimal solution S op t(I) for a given instance I can be computed in time polynomial 
in \I\, where optQ(I) = /q(J, S op t(I))- ■ 

The set of all such P- optimization problems is the P op t class. 

A similar definition, for NP-optimization problems, appeared in Panconesi and Ranjan (1993) 

Definition 3. An NP-optimization problem is defined as follows. Points (i)-(iv) in Def. 
[H above apply to NP-optimization problems, whereas (vi) does not. Point (v) is modified as 
follows: 

(v) The following decision problem is in NP: Given an instance I and a non-negative constant 
K, is there a feasible solution S G Fq(I), such that /q(/, S) > K (for an NP -maximization 
problem), or /q(I, S) < K (in the case of an NP -minimization problem) ? 

The set of all such NP-optimization problems is the NP op t class, and P op t ^ NP op t. 
Definition 4. Decision versions. 

See Def. Given a non-negative constant K and an instance I G Iq, the decision version 
of a P- optimization problem Q asks whether there is a feasible solution S G Fq(I), such that 
/q(7, 5) > K (if Q is a maximization problem), or /q(I, 5) < K (if Q is a minimization 
problem). 

Similarly, decision versions of NP-optimization problems are derived from Def. ■ 
1 Of course, when it comes to computer representation, rational numbers will be used. 

2 Strictly speaking, we should use |/| here, where |7| is the length of the representation of /. However, |7| 
is polynomial in |j4|, hence we can use \A\. 
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As we will see in Sections I2.3H2.51 the decision versions can be expressed as a conjunction of 
an objective function constraint (OFC) and basic feasibility constraints (BFC). 

Assumption 5. We consider ordered structures only. 

Definition 6. ESO IIi Horn is defined as existential second order (ESO) logic where the 
first order part is a universal Horn formula, 

We need the following result. See [7J, or Theorem 3.2.17 (Page 147) in [SJ, or Theorem 9.32 
(Page 154) in [13]. 

Theorem 7. If a decision problem P\ can be represented by an ESO II i Horn sentence, then 
the second order quantifiers can be determined in polynomial time f PTIME,). Hence P\ can 
solved in PTIME. 

This follows from the fact that HornSAT (satisfiability of a propositional Horn formula) can 
be solved in ptimeU [15] . 

The goal is to determine values for the second order quantifiers (the unknowns in the opti- 
mization problem) in ptime; this would imply that the problem represented by the existential 
second order (ESO) sentence is solvable in ptime. 

Assumption 8. We work only with undirected graphs. If there is an edge between vertices u 
and v where u < v, we set E(u, v) = true and E(v, u) = false. 

1.1 More about BFC and OFC 

We represent decision versions of optimization problems as a conjunction of a single objective 
function constraint (OFC) and a set of basic feasibility constraints (BFC). The motivation 
for this comes from a general mathematical programming framework where optimization 
problems are expressed in the form: 

Maximize (or Minimize) /(x), 
subject to the following constraints: 

< 0, 1 < i < m, (1) 
^(x) = 0, l<j<p, 
and x G X (for example, X = W 1 ). 

In ©! /( x ) is the objective function. Additionally, for the decision problem, we are given a 
constant K, a non-negative integer. 

Above, the BFC comprises the m + p constraints <?j(x) < and hj(x) = 0, and the OFC com- 
prises the single constraint /(x) > K for maximization problems (/(x) < K for minimization 
problems) . 

From Theorem [7] and the fact that decision versions of optimization problems can be expressed 
as conjunctions of OFC and BFC, one would expect that for problems in the class P, the 

3 In this paper, we only use ptime as a shorthand for polynomial time, not to represent the problems in the 
class P. 
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OFC and the BFC can be expressed as ESO IIi Horn sentences. However, in Sec. [H we will 
show that this is not true. In addition, we observed the following: 

BFC. Interestingly, we found that the BFC for some NP-complete problems (Maximum 
Clique, Maximum Independent Set and MaxSAT) can also be expressed as ESO Hi Horn 
sentences^- Gate and Stewart [6] also observed this phenomenon when they derived a BFC 
in ESO Hi Horn form for another NP-complete problem, MaxHorn2SAT. 

OFC. Decision versions of optimization problems, no matter what their complexity is, can 
be grouped according to the arity of the second order predicate (say F) which measures the 
objective function value. The objective function value is equal to \F\. For maximization 
(minimization) problems, the OFC is \F\ > K (\F\ < K) respectively, where K is part of the 
input. 

For instance, we will see in Section [2] that a problem in P (Maximum Matching) and two 
NP-complete problems (Maximum Clique and Maximum Independent Set) have identical 
OFC's, since the arity of the objective function predicate F is equal for all three problems 
(equal to one). 

1.2 Our contribution 

A few observations are in order: 

• Descriptive Complexity is a very useful tool at the structure level, not at the machine 
level. 

• A large subset of problems in the class NP is derived from optimization problems, hence 
this is an important class. 

To elaborate on the first item above: The usefulness of Descriptive Complexity lies in the fact 
that if one could write a structure level expression in a certain logic for a decision problem 
V, then it can be deduced that T> requires a certain amount of resource (time and/or space) 
for its computation; that is, we can recognize the complexity class that T> belongs to. 

However, our main result shows that even though a large class of problems (those derived 
from optimization) is ptime solvable, they cannot be expressed in ESO universal Horn logic 
at the structure level. Hence for such problems, the usefulness of Descriptive Complexity is 
lost|, using this particular logic. 

Section [2] We explain the distinction between expressions in ESO Hi Horn logic at the ma- 
chine level and the structure level. We provide structure level expressions for the BFC's of 
two problems in the class NP, and one in the class P (the corresponding optimization ver- 
sions are in classes NP opt and P op t respectively). We observe that at the structure level, a 
problem in P, Maximum Matching, and an NP-complete problem, Maximum Independent 
Set, are identical (same signature and same expressions for the properties that need to be 
satisfied). 

4 In general, it is the combination of the BFC and OFC that makes a problem easy or difficult to solve, not 
just one of them. 

5 Of course, we fully acknowledge contributions in the literature that provided machine level expressions — 
but that is only one side of the story. 
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This means that ESO IIi sentences are unable (too weak) to distinguish between P and 
NP-complete problems at the structure level. 

Section [3] We provide an expression for the OFC when the arity of the predicate measuring 
the objective function value is one. This sentence is in ESO IIi form; however, it is not a 
Horn sentence. (The OFC cannot be expressed as an ESO IT Horn sentence; see Remark 

El) 

Section HI Theorem [20] is the main contribution of this paper. 

For the class P, under ordered structures, we will demonstrate that machine level expressions 
are more powerful. Every property in P can be expressed at the machine level in ESO Hi Horn 
logic (this was proved in [?]), but there is at least one property in P, Maximum Matching, 
for which this is not true at the structure level (proved here). 

Assuming that P ^ NP, we will show that another problem decidable in PTIME, finding a 
solution for Vertex Cover within an approximation bound of two, cannot be expressed in ESO 
Hi Horn logic. 

We also provide a proof using direct products that the upper bound form of the OFC (\F\ < 
K) cannot be expressed in ESO Hi Horn logic (Theorem 1 2 7[) . 

Section [5] (Implications for the class NP): Under ordered structures, at the structure level, 
we observe that an important subclass of NP cannot be expressed in ESO logic where the 
first order part is a universal formula. Also at the structure level, it is still an open problem 
as to whether ESO logic characterizes the class NP. 

Section [6] (LFP logic): The results in this section show that least fixed point logic with count- 
ing (LFP+C) can express problems in P at the machine level, but insufficient to capture this 
class at the (more important) structure level. 

We only work with structure level expressions in this paper. 

2 Expressions for decision problems 

2.1 Expressions at the machine level and the structure level 

In this subsection^, we distinguish between logically expressing a problem at the (lower) 
machine level and the (higher) structure level. In Section H] we will show that there is a 
gap (hierarchy) between these two levels for certain problems in class P, if the problem is a 
decision version of an optimization problem. 

Machine level expressions (MLE). These express the computation of a property (in a 
Turing machine, for instance). Here, computation of a property is defined as checking whether 
a property is true using a computation device, for a given input. 

Such an expression encodes the computation steps of a Turing machine (TM), for example, as 
in the proof by Fagin [5] that ESO logic captures the class NP and the proof by Immerman 
[T2] that LFP (under ordered structures) captures the class P. These expressions state that 
(i) if we go through the steps of a computation device such as a TM, and 

6 We thank Anuj Dawar for his input here. 
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(ii) if the TM finally reaches an accepting state, 

then the property must be true for the given input structure. 

For example, to write a machine level expression for Hamiltonian Cycle (HC), we should first 
design a TM that solves HC, then encode the steps of the TM in an expression. 

Structure level expressions (SLE). On the other hand, structure level logical expressions 
do not express the computation of a property; rather, they directly express the truth of a 
property of a mathematical structure. No computation is involved in such a description. 

(From a mathematician's viewpoint, SLE's are the natural choice to express a problem/property, 
not the MLE's. To logically express a property of a given mathematical structure, what is 
the need to put it through computation first?) 

Properties. What is a property? Examples of properties would include: 

(i) existence of a Hamiltonian cycle (the input is a graph), or 

(ii) satisfiability (the input is a formula in CNF), or 

(iii) the existence of a matching of size at least K (the input being a graph and the number 
K). 

For the purposes of this article, we formally define a property as: 
Definition 9. A property is an isomorphism- closed class of finite structures. 

In terms of a formula r, it can be defined as: 

Definition 10. A property V is defined by a formula r as long as the following is true: 
given an (input) structure A, A \= r iff A has property V . 

The above definition of a property fits in very well with structure level expressions, because 
a formula is exactly the structure level description of a property. 

Definition 11. A decision problem P(V,A) checks whether property V is satisfied by a 
structure A. IfV is defined by a formula r, then P(V,A) checks whether A \= r. 

Henceforth, problem will refer to a decision problem unless we state that it is an optimization 
problem. 

Machine level expressions for the class NP (class P) were provided by Fagin [5] (Gradel [7]) 
respectively. (The expressions for P assume the presence of a successor relation as a part of 
the first order vocabulary.) 

In this article, we show that for structure level expressions, ESO Hi Horn is insufficient to 
capture the class P even when the structures are assumed to have a linear order, if the 
property represents the decision version of an optimization problem. 

The difference between the expressions for machine level and the structure level has been 
pointed out earlier. See, for instance, Abiteboul, Vardi and Vianu pp. Henceforth, we work 
with structure level expressions in this article. 
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2.2 Signature for the input structure 

The standard signature for graphs is expanded to accommodate the expressions for the single 
OFC in each problem. The BFC for each problem will be illustrated in Sees. I2.3H2.5I 

A three-sorted input structure (V, K, N) will be used. V will represent the set of vertices, K 
the lower bound on the objective function value, and N represents the set of natural numbers 
< % < K . We need two switch relations num{x) and vertex{x); then num(x) is true iff 
x £ N, and vertex(x) is true iff x £ V. Then a variable x has the interpretation x^ = K 
by a structure A iff A \= ^num(x) A ->vertex(x). 

Since we work with ordered structures, we assume that V < K < N and hence we only 
consider structures A such that 

• A \= VnVf [vertex(u) A ->vertex(v) A ->num(v)\ — >• (u < v) 

(This means, if u is a vertex and v is interpreted to be the lower bound K, then u < v); 
and 

• A \= VnVf [num(v) A —>vertex(u) A —mum(u)] — > (u < v) 
(If v 6 N and u is interpreted as K, then u < v). 

The sub-domains V and iV are assumed to be ordered. As usual, the binary relation E 
represents edges of the graph. 

We need a few constants: min and max refer to the first and last elements in V respectively. 
We introduce another constant called "base" to the input vocabulary, such that base < min, 
where min is the "first" vertex. These constants will be used in Sec. [3] (appendix). 

2.3 Maximum Clique: Decision Version 

Definition 12. Maximum Clique problem (decision version). 

Given a graph G = (V, E) and a constant K , is it possible to mark vertices in V in such a 
way that 

(i) BFC: For any x,y E V where x < y, if x and y are marked, then (x,y) € E, and 

(ii) OFC: The number of marked vertices is at least K ? 

(A note about K. Readers may ask, "How is K represented in the input? in binary or 
unary?". Our answer: This may be relevant for problems with large data, such as Maximum 
Flow or Minimum Cost Flow, where edge weights are part of the input. However, the issue 
is irrelevant for problems such as Maximum Clique where the solution value lies in the [0, n] 
range, where n is the number of vertices. If K > n, then the solutions are immediately 
disqualified — there cannot be a solution to the Maximum Clique problem with a clique size 
more than the number of vertices. If K < n, the input size for K is at most that of n, whether 
in binary or unary.) 

For simplicity, assume the following: 

Assumption 13. K <n, where n is the number of vertices in the input graph (or in general, 
the domain size). 
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Let F be the set of marked vertices. We can then express the problem in ESO (existential 
second order) logic: 

= 3F (</>! A <f> 2 A 3 ), where (2) 

01 = ViiVv [F(u) A F(v) A (u < v)] — > E(u,v), 02 = Vu> .F(u>) — > vertex{w), (3) 

and 03 is an ESO sentence which captures the OFC, which is \F\ > K (the lower bound on 
the objective function value). Also, 02 requires that w £ V (that is, w ^ K and w ^ N) if 
it is to be counted towards the objective function. 

01 captures the BFC; it says that if two vertices are marked, then there is an edge between 
them in the input graph G = (V, E). 

Rewrite the above in ESO LTi Horn form as: 

= 3F (0i A 2 A 03 ), where (4) 

01 = VuVv ->F(u) V ->F(y) V -*(u < v) V E(u, v), 02 = -*F(w) V vertex(w). (5) 

Of all the disjuncts in ([5]), except for F(u), F(v) and F(w), the rest are first order; their 
truth values can be computed from the input and substituted into the formula. 

Remark 14. The Horn condition applies only to the second order predicate F and to other 
second order predicates created to capture the OFC (\F\ >K). 

The above remark also applies to the OFC's for Maximum Matching and the Maximum 
Independent Set problems (see the next two subsections). 

Remark 15. The OFC, \F\ > K, cannot be captured in ESO Yi\ Horn logi$\, even in 
the presence of ordering. (Thanks to Anuj Dawar and Wilfrid Hodges for communicating 
this information.) This can be shown using the fact that universal sentences are preserved 
under substructures. Also note that if a property fails to hold under the ordered structures 
assumption, it fails to hold under the "successor relation" assumption as well, since the former 
assumption is stronger and subsumes the latter. ■ 

As we see below, the BFC sentences for Maximum Clique (an NP-complete problem) and 
Maximum Matching (a problem in P) are almost identical. They become identical after we 
substitute the truth values for the first order predicates. 

2.4 Maximum Matching: Decision Version 

For this problem, we find it more convenient to let the universe for the input structure be 
the edge set E. We use a binary vertex relation W(u, v) to indicate that there is a common 
vertex between edges u and v. Hence we represent a graph by G = (E, W) where E is the 
edge set and W is the binary relation between the edges just described. We call this the line 
graph version of Maximum Matching. 

Another way to think about this: G*2 = (E, W) is the line graph of a given graph G\ = (V, E). 
In general, \W\ need not be equal to |V|; however, \W\ is bound by a polynomial in \V\. 

7 Indeed, if the OFC can be expressed in ESO IIi Horn, then it would mean that the decision version of 
Maximum Clique can also be expressed in ESO IIi Horn, hence this problem would be polynomially solvable. 
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Given a line graph G 2 , can the original graph G\ be uniquely re-constructed? The answer is 
yes, as long as |V| > 5. This result is due to Whitney [18] (communicated to us by Sanming 
Zhou). 

Due to Assumption [SI we let 

"o = [W(u,v) A vertex(u) A vertex(v)] — > (u < v). (6) 

Note: Though in reality, u and v are edges, we still let vertex(u) = vertex(v) = TRUE so as 
to use the signature defined at the beginning of this section. 

Definition 16. Maximum Matching, decision problem (line graph version). 

Given a graph G = (E,W) and a constant K, is it possible to mark (or match) edges in E 
in such a way that 

(i) BFC: For any two edges u,v € E, u < v, if u and v are adjacent, i.e. W(u,v) is true, 
then at least one of them must not be marked, and 

(ii) OFC: The number of marked edges is at least K? 

Let F be the set of marked (or matched) edges. We can express the problem in ESO logic: 

a = 3F (ctQ A Qi A 02 A 03), where (7) 

ai = VuVi; [F(u) A F(v) A (u < v)] ->■ -W(u, v), a 2 = Vu> F(w) -4 vertex(w), (8) 
and «3 is an ESO sentence which captures the OFC, which is \F\ > K. 

ct\ states that if edges u and v are matched, they do not share a common vertex. Note that 
ao is a first order expression, hence its truth value can be substituted into a. 

We can rewrite the above in ESO IT Horn form: 

a = 3F (ao A a\ A 02 A 0:3), where (9) 

a\ = VuV-u -*F(u) V ->F(v) V -i(u < v) V ->W(u, v), a 2 = ~^F(w) V vertex(w). (10) 

Of all the disjuncts in ([T0|) . except for F(u), F(v) and F(w), the rest are first order; their 
truth values can be computed from the input and substituted into the formula. The Horn 
condition applies only to the second order predicate F and to other second order predicates 
created to capture the OFC (\F\ > K). 

2.5 Maximum Independent Set: Decision version 

Definition 17. Maximum Independent Set or MIS (decision version). 

Given a graph G = (V, E) and a constant K , is it possible to mark vertices in V in such a 
way that 

(i) BFC: For any x,y £V where x < y, if x and y are marked, then (x,y) £ E, and 

(ii) OFC: The number of marked vertices is at least K ? 



9 



As we will observe, at the structure level, MIS and the line graph version of Maximum 
Matching are one and the same problem once the truth values of the first order predicates 
are substituted. 

Let the domain be the set V of vertices and let relation E define the set of edges. (That is, 
we represent the given graph, not the line graph.) We express the MIS problem in ESO logic: 



/3i = VWi> [F{u) A F{v) A (u < v)] — > ->E(u, v), /3 2 =\/w F(w) -4 vertex(w), (12) 
and /?3 is an ESO sentence which captures the OFC, which is \F\ > K. 
Rewrite the above in ESO LTi Horn form: 



/3i = ViiVi> -iF(u) V -iF(v) V -i(u < v) V ->E(u, v), f3 2 = Vw -iF(w) V vertex(w). (14) 

We note that the BFC for the Maximum Independent Set (MIS) problem is exactly in the 
same format as (fl~0|h once W is replaced by E. 

The second order predicate for the MIS problem is F, and E is a first order relation. 

Discussion. Yet again we observe a strong resemblance between the BFC expressions for 
an NP-complete problem (Maximum Independent Set) and a problem solvable in polynomial 
time (Maximum Matching). Recall that Maximum Clique is also NP-complete. 

After a transformation to the line graph in the case of Maximum Matching: We observe 
that the BFC expressions ([5]), (jlOp and (|14p are very similar. What matters the most, the 
expressions involving the S.O. predicates are the same. After we substitute the truth values 
for the first order predicates, the three expressions become identical. 

In a model- checking problem, we are given a structure A and a formula (f), and we are asked 
whether A \= <fi. 

Remark 18. At the structure level, MIS and the line graph version of Maximum Matching 
(Def. \16\) are one and the same problem. They have the same input signature and identi- 
cal ESO expressions ifP HlOj) and 1 13[\14\ ). Hence the model- checking problem related to the 
problems in Def. [Td\ and [77] are the same. 

And coming to the OFC: 

Remark 19. The OFC expression will be identical for all the three problems considered so 
far, since the arity of F is the same (i.e. unary) in all three cases. 

All three problems can be expressed as a conjunction of their respective BFC and OFC. From 
this, we can conjecture that Maximum Matching can be expressed in ESO H± Horn Logic if 
and only if we can do the same for Maximum Independent Set and Maximum Clique. 

However, in Section HI we will show that Maximum Matching cannot be expressed as an ESO 
Hi Horn sentence. (The inexpressibility of the other two problems in ESO Hi Horn can be 
proved similarly.) 

In addition, we will show that another polynomially solvable decision problem, finding a 
solution for Vertex Cover within an approximation bound of two, cannot be expressed in 
ESO Hi Horn logic (this proof assumes that P ^ NP). 



P = 3F (fa A fa A /3 3 ), where 



(11) 



P = 3F (/3i A /3 2 A /3 3 ), where 



(13) 



10 



3 Expression for the Cardinality Constraint |F| > K 

In this section, we derive an ESO expression for the cardinality constraint \F\ > K. The 
construction given here was suggested by Neil Immerman |14j . 

I is a second order binary relation, from numbers to elements of the original domain A. 
That is, L : N — > A. Assume that iV = {0, 1,2, ••• , n}, where n = \A\, the number of 
vertices in the input graph. 

7/i = Vt/Vt; L(y, v) — > [F(v) A num(y)] = \/y\/v -<L(y, v) V [F(v) A num(y)] 
= Vy Vt; [-*L(y, v) V F(v)} A [-iL(y, v) V num(y)}. 

[L is a binary relation. For L, the first argument x £ N, and the second 
argument y G F.] 
7/2 = Vtj -F(t') — > vertex(v) 

= Vy ^F(tj) V vertex(v). [as in ©, ([12]) and (fI3|)] 

L is not a function; it is possible that L(x, u) and L(x, v) are true when u ^ v. But the 
inverse of L is a function; given u £ A, it reverse maps to a unique x, in L(x,u). That is, 

7/3 = MxiyMu [L(x,u) A L(y,n)] — >• (x = y). (15) 

(Every u £ A has a unique index i£JV.) 

Let the first order relations succN and succA define succession within iV and A respectively. 
The Horn condition will not apply to these. Add the following 2 clauses to the expression of 
the cardinality constraint: 

7/4 = L(0, base) A SuccA(base, min). (16) 

From here on, the forward recursion will take over and build all the correct L's. The incorrect 
L's will be established false by backward recursion. 

L is a second order relation; it scans the entire domain A from base to max; as it scans A, it 
keeps a running count of the number of elements in A that are also members of F. (Note 
that base has been assigned an index of zero in (|16p .) 

Forward Recursion: 

775 = VxVyVnVT; [SuccN (x, y) A SuccA(u, v) A (u < max) A L(x, u) A F(v)\ — > L(y, v) 

= VxVyVuVv ~^SuccN(x, y) V -iSuccA(u, v) V (u > max) V -<L(x, u) V -<F(v) V L(y, v). 

(17) 

If v 6 F, then u and v have different indices (x and y respectively). Otherwise, they share 
the same index (x). 

7/6 = VxVyVTiVf [SuccN(x, y) A Stjcc^u, f) A (u < max) A L(x, u) A — > L(x, v) 

= \/x\/y\luiv -^SuccN(x, y) V ^SuccA(u, v) V (n > max) V -*L(x, u) V i ? (f) V L(x, u). 

(18) 
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Backward Recursion: 

r/7 = VxVyVwVu [SuccN(x, y) A SuccA(u, v) A (v > rain) A L(y, v) A F(v)] — > L(x, u) 

= VxVy V-uVtj ^SuccN(x, y) V -*SuccA{u, v) V (v < rain) V ~*L(y, v) V ~^F{v) V L(x, u). 

(19) 

If u £ -F, then u and v have different indices (x and y respectively). Otherwise, they share 
the same index (y). 

rjs = VxVyVitW [SuccN(x, y) A SuccA(u, v) A (v > rain) A L(y, v) A -i-F(i>)] — > L(y, u) 

= Mxiy \/u\/v ^SuccN(x, y) V ^SuccA(u, v) V (v < rain) V ^L(y, v) V -F(u) V L(y, u). 

(20) 

The index of max is at least k: 

7/9 = Vx L(x, max) — > (x > k) = Vx ->L(x, max) V (x = fc) V (x > k). (21) 

The first order variables in each of the nine n expressions can be renamed so that the variables 
in each expression is distinct. Thus the complete expression for the OFC is 

9 

3L /\ m . (22) 

i=l 

As mentioned in Remark 1191 the OFC expression above can be used for the three problems 
in Sec. [2j Again, as mentioned before, this is an ESO expression where the first order part is 
universal; however, it is not Horn, since expressions (|18p and (I20p are not. 

4 Some PTIME decidable properties cannot be expressed as 
ESO 111 Horn sentences at the structure level 

Under ordered structures, let V\ (V2) be the class of ptime decidable properties that can be 
expressed in ESO IIi Horn logic at the machine (structure) level respectively. 

In [7], it was shown that V2 Q V\ = P. Actually in [7], the author assumes the "successor 
relation" assumption, not ordered structures. However, if a property holds good under the 
former assumption, it also holds good under the latter (see Remark 1 15|) . 

In this section, we show that there are properties in P that cannot be expressed in ESO Hi 
Horn logic at the structure level. That is, we will prove that 

Theorem 20. ViC.V\. 

Theorem [20] can also be proven using the "preservation under substructure" property men- 
tioned in Remark [15j However, we will give two other proofs as follows: 

(i) One using reduced products, by showing that the decision version of Maximum Matching 
(a ptime decidable property) cannot be expressed in ESO Hi Horn at the structure level; 

(ii) Another using approximability theory as applied to the Vertex Cover problem. This proof 
assumes that P 7^ NP. The question of whether there exists a feasible solution to a Vertex 
Cover instance, whose value is within a factor of two of the optimal solution value, is ptime 
decidable. 
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4.1 Proof based on reduced products 

We show that Maximum Matching (decision version) lies in V\\Pi-, which will prove Theorem 

M 

The proof here is adapted from the one given by Dawar [3J, which showed that the cardinality 
constraint \F\ > K cannot be expressed by a Horn formula, when F is an existentially 
quantified second-order predicate. F is unknown; it is not a part of the input. Recall that K 
is the lower bound on the objective function value. 

The proof relies on the fact that universal Horn formulae are preserved under reduced prod- 
ucts. That is, if ip is a universal Horn formula and structures A and B satisfy ip, then so 
does their reduced product A.B. 

Lemma 21. Maximum Matching (decision version) cannot be expressed in ESO Ii\ Horn at 
the structure level, and hence this problem is not a member of V\ . 

Proof. Suppose we have a signature divided into two parts: 

A set of relation symbols £ = {Si, ■ ■ ■ , S m , F}, and a separate set p of relation and constant 
symbols. 

The relations in p are first order (input), hence the Horn restriction does not apply to these. 
On the other hand, the relations in S are second order (quantified), and hence the Horn 
condition applies to these relations. In particular, it applies to F. 

Let A and B be two input structures on the same universe {1, • • • ,29} [9 G N) with a 
linear ordering; furthermore, for all symbols in the signature p, let A and B agree on the 
interpretation of the symbols; however, they may differ on relations in S. 

Let us now define a product structure (a reduced product) A.B to be the structure on the 
domain {1, • • • , 29} with a linear ordering, in which every symbol in the signatures X and p is 
interpreted as the intersection of the two interpretations in A and B. Then it is easy to show 
that any Horn formula (ft that is satisfied in both A and B is also satisfied in A.B. (This 
is exactly by the argument that shows that Horn formulae are closed under direct products; 
for example, see Chapter 9 (Page 417) of the book by Wilfrid Hodges [10], or Page 493 of his 
book chapter [TT].) 

Then suppose we let A and B be structures with, say, 29 elements {1, • • • , 29}; In A, we let 
F consist of the first 9 elements {1, • • • ,9}; and in B, we let F consists of the last 9 elements 
{9 + 1, • • • , 29}; Also, we let K = 9 in both structures. So, the property li \F\ > K" is true in 
both A and B. That is, there exists a matching of size at least 9 in A and B. 

However, ll \F\ > K" is false in A.B, since in this structure, F is empty, even though the 
lower bound K is still equal to 9. 

It follows that the constraint \F\> K cannot be expressed as a Horn formula, and hence we 
conclude the decision version of Maximum Matching, a ptime solvable problem, cannot be 
expressed in ESO Hi Horn at the structure level. □ 

This proves Theorem l20l 
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4.2 Proof based on approximation theory 



is the 



Here we provide another proof of Theorem [20l under the assumption that P ^ NP. We 
use approximation theory to show that some decision problems derived from NP-hard op- 
timization problems cannot be expressed in ESO IIi Horn logic. We need a few definitions 
first. 

Given an NP-optimization problem Q (see Def. [3|), we define the following: 

Definition 22. Approximation ratio. Given an instance I of Q and a feasible solution 

S £ Fq(I), the approximation ratio is defined as — ~ f or minimization problems and 

J^~JJ f or maximization problems. We assume that /q (/) > ; where /q° (-0 
optimal solution value to instance I. 

Definition 23. Approximation problems Aq and Aq. 

Problem Aq. Given an instance I of problem Q and a parameter Kq that depends on Q, is 
there a feasible solution S 6 Fq(I), such that the approximation ratio is at most Kq? 

Problem Aq. Given Q, I and a parameter Lq that depends on Q, is there a feasible solution 
S € Fq (I) , such that the approximation ratio is at most Lq ? 

(Lq < K Q .) 

The APX class of NP-hard optimization problems is defined as follows: 

Definition 24. The APX class. Consider a problem Q from the class NP op t. Then Q is 
a member of the class APX iff the decision problem Aq can be solved in ptime. 

Of course, if Aq can be solved in ptime and Kq = 1, then Q E P pt- 

Example: Steiner Tree is a problem in APX, with parameter Kq = 2. The optimization 
version of this problem can be described as follows: 

Given : A graph G = (V,E), a subset S of V and edge weights w : E — > N. 
To Do : Find a subgraph T of G such that 

(i) The BFC: T includes all the vertices in S, T is a tree, and 

(ii) The objective function: The sum of the weights of the edges in T is to be minimized. 

Hence given an instance / of the Steiner tree problem, there is a ptime algorithm which can 
decide whether there is a feasible solution to / with an approximation ratio of at most two. 

However, Kq is an upper bound on the approximation ratio; we know that there is a ptime 
algorithm that guarantees a feasible solution within this upper bound. However, many prob- 
lems (such as Vertex Cover, below) in the class APX also have a proven lower bound Lq. For 
such problems, the approximation problem Aq is known to be NP-complete. 

Definition 25. Vertex Cover (optimization version). 

Given : A graph G = (V,E). 

To do : Mark vertices in V in such a way that 

(i) BFC: For any x,y £V where x < y, if (x, y) G E, then either x of y is marked, and 

(ii) The objective function: The number of marked vertices is to be minimized. 
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For Vertex Cover, the upper bound K vc = 2 and the lower bound L vc = 7/6 (see [9]]j. In 
fact, for L vc , one could use any number 7 such that 1 < 7 < 7/6. For ease of representation 
in an ESO formula, let us use 7 = 1.1. 

Let A„ c and A^ c be the approximation problems Aq and Aq respectively (from Def. |23|) . 
applied to Vertex Cover. 

Ay C and A^ c are members of the class NP; in particular, A^ c is a member of the class P, and 
A^ c is NP-complete. 

Then it follows that A„ c can be expressed as an ESO LTi Horn sentence <\>\. This sentence will 
include a sub-formula stating that the approximation ratio a(I, 5) is at most two (a(I, S) < 
2). 

However, if we replace the formula (for the condition) a(I, S) < 2 with the formula for 
ot{I, S) < 7 = 1.1, we obtain a new ESO Hi Horn sentence <p2 which expresses A^ c . 

Though Ay C is known to be NP-complete, we have just obtained an ESO Hi Horn sentence 
(</>2) for this problem, implying that A^ c can be decided in ptime. (See Theorem [71) This 
contradicts our assumption that P 7^ NP. 

(If P = NP, then K Q = L Q = 1.) 

Hence we have shown the following: 

Lemma 26. Assuming that P 7^ NP, A^ c , a problem solvable in ptime, cannot be expressed 
as an ESO Hi Horn sentence at the structure level; that is, A^ c € V\\P2- 

This proves Theorem 1201 under the assumption that P 7^ NP. 
Here is another example: 

Linear Programming (LP). In the case of the decision version of LP, every constraint, whether 
OFC or BFC, is of the form Y2j a ij x j — &i or Ylj a ij x j — ^ (where the Ojj's and the b^s are 
constants), that is, checking the cardinality of the summation on the left side. From Remark 
[T5l we know that such constraints cannot be expressed in ESO Hi Horn form at the structure 
level. Hence LP is not a member of the class Vi- This observation can be generalized to other 
Mathematical Programming problems such as Convex Programming where the constraints are 
of a more general form such as /(x) < K (where K is a constant). 

4.3 \F\ < K cannot be expressed as a universal Horn formula 

Previously we saw that the lower bound \F\ > K cannot be represented in ESO Hi Horn. 
We can show a similar inexpressibility result for the upper bound {\F\ < K) using direct 
products. 

The proof relies on the fact that universal Horn formulae are preserved under direct products. 
That is, if ip is a universal Horn formula and structures A and B satisfy tp, then so does their 
direct product A x B. 

Consider identical input structures A and B, with domain as {1, 2, • • • , 26}. Let every 
relation in A and B be interpreted in the same way, including F (the objective function 
relation). 

8 Johan Hastad has been awarded the 2011 Godel Prize for this publication. 
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Let F iA) = F iB) = {!,■■■ ,9}. Hence \F^ \ = \F^ \ = 9. Let K = 9 > 1. 
In the direct product A x B, the domain |A x B| = {(1, 1), • • • , (29, 29)}. 
Then F^ AxB ^ = {(1, 1), • • • , (9, 9)}. That is, \F^ AxB ^ \ = 9 2 >K. 

Hence the upper bound on the objective function, that \F\ < K, is not preserved in the 
product structure. We can conclude that 

Theorem 27. The upper bound form of the OFC, \F\ < K , cannot be captured in ESO H\ 
Horn logic. 

This would be applicable to decision problems in P that are derived from minimization 
problems. 

5 Implications for the class NP and Fagin's theorem 

Under ordered structures, let Vj, (V4) be the class of NP-decidable properties that can be 
expressed in ESO logic where the first order part is a universal formula at the machine 
(structure) level respectively. 

It is known that C V3 = NP. See Chapter 7 in [13] or Chapter 3 in |S]. 

However, no problem in NP derived from a maximization problem can be expressed with a 
universal first order part, since the OFC cannot be so expressed. 

For the problem as such (considering the BFC and the OFC), it will violate the "preservation 
under substructure" property of universal formulae. For a given structure A and lower bound 
K, if it satisfies the BFC and the OFC, we can consider a substructure that is small enough 
which will violate the OFC. It follows that 

Theorem 28. C V 3 . 

It is still open whether Fagin's theorem (that ESO logic characterizes the class NP) holds at 
the structure level when the input structures have no ordering. 

6 Structure level and least fixed point 

The Immerman-Vardi theorem (see [8], page 70), which states that least fixed point logic 
(LFP) characterizes P for the class of ordered finite structures, is true at the machine level. 
We now investigate whether it holds good at the structure level. 

Using results from Atserias et al [2], we can show that least fixed point logic with counting 
(LFP+C) is also insufficient to capture ptime at the structure level. Consider the following 
problem: 

Definition 29. feasible. Given a matrix A G Z™ xn and a vector b € Z?p, does the system 
of equations AX = b have a solution X € Z?> ? 

Under ordered structures, let V5 (V%) be the class of ptime decidable properties that can be 
expressed in LFP+C at the machine (structure) level respectively. 
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Theorem 30. (Immerman-Vardi theorem) V§ = P. 

Coou is the logic obtained from atomic formulae with negation, infinitary conjunctions and 
disjunctions and counting quantifiers 3- 1 , where i is a non-negative integer. is a- fragment 

of Cocxj with only k distinct variables in the formula. is defined as: 

^oou = U ^ooui- (23) 

feasible can be decided in ptime [2]. It was also shown in [2] that feasible is not 
expressible in C^. LFP+C is a fragment of C£^; any property that can be expressed in 
LFP+C can also be expressed in C^ w . 

Since feasible, a problem in P, cannot be expressed in C^^,, it cannot be expressed in 
LFP+C either. From this, we conclude that LFP+C is insufficient to capture ptime at the 
structure level. That is, 

Theorem 31. V e C V 5 . 

7 Conclusions 

In the last 40 years, we have seen important contributions that provided machine level ex- 
pressions [5j[T2]. However, as mentioned in Sec. 11.21 Descriptive Complexity is a much more 
useful tool at the structure level than at the machine level. 

For a large subset of the class P (those problems derived from optimization), the usefulness 
of Descriptive Complexity is lost when it comes to recognizing their ptime decidability using 
expressibility in ESO Fix Horn logic. 

In the case of ESO Hi Horn logic, we were able to demonstrate a separation between machine 
level and the structure level — the machine level can capture P, whereas the structure level 
cannot. Hence we need to investigate other logics that can close this gap. The inexpressibility 
proofs in Section [4] also hold good when the input structures are ordered. 

The Immerman-Vardi theorem is true at the machine level, but not at the (more important) 
structure level. Also at the structure level, it is still open whether Fagin's theorem holds 
when the input structures are unordered. 

Another interesting thing we have observed is, the BFC expressions for some NP-complete 
problems are exactly identical to those for some problems in P (Remark [18]) • Considering this 
fact and Remark 1191 about the OFC, we conclude that ESO universal sentences are unable to 
distinguish between P and NP-complete problems at the structure level. 
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